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ABSTRACT 

The precision of cosmological parameters derived from galaxy cluster surveys is limited by uncertainty in re- 
lating observable signals to cluster mass. We demonstrate that a small mass-calibration follow-up program can 
significantly reduce this uncertainty and improve parameter constraints, particularly when the follow-up targets 
are judiciously chosen. To this end, we apply a simulated annealing algorithm to maximize the dark energy 
information at fixed observational cost, and find that optimal follow-up strategies can reduce the observational 
cost required to achieve a specified precision by up to an order of magnitude. Considering clusters selected 
from optical imaging in the Dark Energy Survey, we find that approximately 200 low-redshift X-ray clusters 
or massive Sunyaev-Zel'dovich clusters can improve the dark energy figure of merit by 50%, provided that the 
follow-up mass measurements involve no systematic error. In practice, the actual improvement depends on (1) 
the uncertainty in the systematic error in follow-up mass measurements, which needs to be controlled at the 
5% level to avoid severe degradation of the results; and (2) the scatter in the optical richness-mass distribution, 
which needs to be made as tight as possible to improve the efficacy of follow-up observations. 

Subject headings: cosmological parameters — cosmology: theory — galaxies: clusters — galaxies: halos — 
methods: statistical 



1. INTRODUCTION 

The dynamical properties of dark energy can be constrained 
with two phenomena. The first is the expansion of the uni- 
verse: dark energy has dominated the energy density of the 
universe for the past 4 billion years and has accelerated its ex- 
pansion. The second is the growth of structure; since dark en- 
ergy counteracts gravitational attraction, it slows the growth 
of structure. Galaxy cluster surveys explore both phenom- 
ena at the same time: the abundance and the correlation func- 
tion of galaxy clusters depend on expansion history and struc- 
ture growth, thus providing powerful probes of dark energy. 
Given the statistical power of ongoing and future surveys, 
galaxy cluste rs have become an i ndispensable probe of dark 
energy (e.g. IWang & Steinhardl fl998t lHaiman et all l200lt 



Holder et all 1200 It I Levine et al 1 l2(XMlHuff200T l Rozo et al.1 
2007allbL 120101 and references therein). 

Four cluster detection methods have been well estab- 
lishe d: the intracluster hot gas can be identified via X-ray 
(e.g.. lEbeling et al.1 119981 l2000l 120011 IVikhlinin et al. 1 [19981: 
iBohringer et al.1 120041) or Sunyaev-ZeFdovich (SZ ) effects 
(e.g.. IStaniszewski et aD l2009t iHincks et all 120091 see also 
ICarlstrom et al J 120021) : the mass conce ntrations can be iden- 
tified using weak lensing shear (e.g., IWittman et al.l l200ll 
120061) : or the galaxies in clusters can be identified in optical 
or infr ared surveys (e.g.. iPostman et all Il996t iKoester et all 
I2007t lEisenhardtetalJ 120081) . Large cluster surveys us- 
ing each method are ongoing or forthcoming, and cosmo- 
logical parameter constraints from cluster surveys have re- 
cently become competitive with other dark energy probes 
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(e.g.. iMantz et alj|2008t iHenrv et al J 120091: iRozo et aT]|20Tol 
IVikhlinin et al.ll2009bl) . 

The key issue for extracting cosmological information from 
clusters is the fidelity of the mass tracer. In a survey, the 
mass tracer can be self-calibrated by c ombining the infor ma- 
tion from counts and sample variance dLima &M2001 . In 
addition, if a sample or sub-sample of clusters is observed 
using multipl e methods, the cluster ma s s can be further cal- 
ibrated (e.g.. iMaiumdar & Mohii l200l 12004 ICunhal 120091: 
ICunha et al.ll2.009t) . Determining the most effective approach 
to improve the constraining power of clusters using multiple 
mass tracers is particularly timely, as multi-wavelength obser- 
vations will soon become available for large cluster samples. 

In this work, we focus on follow-up observations for opti- 
cal cluster surveys. We are particularly interested in how dark 
energy constraints from these surveys can be improved when 
a sub-sample of the clusters has better mass measurements 
from other methods, e.g., X-ray or SZ. Our goal is to char- 
acterize how follow-up observations should be designed and 
what precision is required in order to maximize dark energy 
information. 

Optical surveys identify massive clusters as agglomerations 
of galaxies. Since the physics of galaxy formation is much 
more complicated than the physics of hot intracluster gas, op- 
tical mass tracers are not as well understood as X-ray or SZ 
mass tracers. Nevertheless, the optical richness-mass distri- 
bution can be empirically determined, and precise cosmologi- 
cal parameters ha ve been derived from optically selected c lus- 
ter samples (e.g.. lGladders et al.ll2007t IRozo et al.ll2010l) . In 
the near future, optical surveys such as the Dark Energy Sur- 
vey (DES0), the Panoramic Survey Telescope & Rapid Re- 
sponse SysterrQ, and the Large Synoptic Survey Telescop^] 
will be able to identify substantially larger and higher-redshift 
cluster samples, which will significantly improve our knowl- 
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edge of dark energy equation of state w. 

In this work, we assume the statistical power and param- 
eters rele vant to PES , apply the self-calibration method pro- 
posed by iLima & Hul (120041) to calculate our fiducial cosmo- 
logical parameter constraints and explore how follow-up ob- 
servations will improve dark energy constraints. Throughout, 
we quantify dark energy constraints with the figure of merit 
(FoM) proposed in th e Report of the Dark Energy Task Force 
dAlbrecht et alj[2006h : 



FoM= 1/Vdet Cov(w ,w fl )= [cr(w a )cr(w p )Y 



(1) 



where w = wo + (l -a)w a and w p is calculated at the pivot red- 
shift for which w is best constrained. The cu rrent value o f the 
FoM from WMAP5, SNe, and BAO is 8.326 dWangl2008l) . As 
a reference, the DETF report predicts that the FoM from Stage 
III cluster surveys (of which DES is an example) will range 
from 6. 1 1 to 35.21, depending on the prior on the observable- 
mass distribution. 

Assuming that the observable-mass distribution follows 
power laws in both mass and redshift, we find that the FoM ex- 
pected from a DES-like survey is 15.5 using a self-calibrated 
analysis. We apply a simulated annealing algorithm to design 
follow-up strategies that maximize the FoM, starting with the 
limiting case in which follow-up mass measurements have in- 
finite precision. We then study how the FoM improvements 
depend on real-world complications, finding that the efficacy 
of follow-up observations is likely to be limited by the system- 
atic error in follow-up mass measurements. We finally con- 
sider observational issues and design different cost-effective 
f ollow-up strategies for X-ray and S Z. 

iMaiumdar & Mohri (120031 12004 have previously investi- 
gated how follow-up mass measurements of a fraction of the 
X-ray or SZ selected cluster sample can constrain the cluster 
evolution and improve dark energy constraints. Our major im- 
provement upon these studies is the optimization of follow-up 
strategies. We also include counts-in-cells and mass binning, 
and explore how scatter and possible systematic error of mass 
tr acers can affec t the efficacy of follow-up observations. 

iCunhal j2009f) has recently studied the joint analysis of over- 
lapping optical and SZ cluster surveys. By studying the clus- 
ter abundances in both surveys, the observable-mass distribu- 
tion can be cross-calibrated. In contrast to this study, we con- 
centrate on one survey and its follow-up observations. That 
is, our method does not require another complete survey but 
instead focuses on a small and optimized follow-up program. 

This paper is organized as follows. We briefly review dark 
energy constr aints from clusters and the self-calibration anal- 
ysis in Section l2~T1an d describe our survey and model assump- 
tions in Section 12.21 We pr esent the modeling for follow-up 
observa tion s in Section lXTl and the optimization procedure in 
Section [3~2l Real-world complications of follow-up observa- 
tions are explored in Section [4] Optimizations considering 
observational issues specific to X-ray and SZ clusters are car- 
ried out in Section We discuss other relevant studies in 
Section[6]and conclude in Section|7] 

2. DARK ENERGY CONSTRAINTS FROM GALAXY CLUSTERS 
2.1. Self-calibration: A Review 

In this section, w e briefly review the self-calibration for- 
malism developed bv lLima & Hul (120041). For detailed discus - 
sions, we refer the read er to ILima & Hul d2004 120051 120071) . 
IHu & Cohnl (I20061) . and lWu et all 120081 

In a galaxy cluster survey, one studies how cluster counts 



depend on some cluster mass proxy to infer the dark matter 
halo mass function, which provides constraints on the prop- 
erties of dark energy. To infer the mass function from data, 
one needs to relate the observable properties of clusters to 
the halo mass; thus, the uncertainty in this observable-mass 
distribution limits the constraining power of clusters. The 
observable-mass distribution can be self-calibrated using a 
counts-in-cells analysis, in which the survey volume is di- 
vided into small cells and the halo bias can be calculated 
from the sample variance among the cells. By measuring both 
counts and sample variance, the mass function and the halo 
bias are fit simultaneously, the observable-mass distribution 
can be self-calibrated, and the dark energy constraints can be 
improved. 

Let us consider a cell of volume V c in a narrow redshift 
range in a survey. We denote the cluster mass proxy by M b s 
and further bin our cluster sample in M b s with a binning func- 
tion 0(lnM o b s ), which equals unity inside the bin and zero 
elsewhere. In this bin, the mean cluster counts (m) and the 
mean cluster bias (b) can be calculated from the mass func- 
tion dn/dlnM, the halo bias b(M), and the observable-mass 
distribution P(lnM obs | InM): 



dn 
dlnM 

= — / dlnM-^- 
m J dlnM 



m = V c / d\nM 



InM) , 
b(M)((j)\lnM) 



(2) 
(3) 



where 



InM) = / d\nM obs P(lnM obs | lnM)rA(lnM obs ) . (4) 



We assume the redshift bin is thin enough and simply use 
the function values at the middle redshift instead of averag- 
ing over the redshift bin._ 

The mean cluster bias b determines the number fluctuations 
among cells due to the large-scale clustering. If cell ; has clus- 
ter counts nij and bias the corresponding sample variance 
among the cells is given by 



Sij = {(mj - mj)(m j -mj)) 
= mimjbjbj(j\ c , 



(5) 



where er v is the variance of the dark matter density fluctua- 
tion in volume V c . We assume that the cell volume is large 
enough for the covariance between cells to be negligible. 

In a survey, one observes m,- and Sy, calculates b,, and self- 
calibrates P(lnM b s | InM) based on Equations|2]and[3] We use 
matrix notations m and S to indicate the data in multiple mass 
and redshift bins and further define C = diag(m) + S. The con- 
straints on model parameters can be obtained from the Fisher 
matrix 

F aP = m^C'm /3 + iTrtC-'S^C-'S^] , (6) 

where the comma followed by a subscript indicates the partial 
derivative with respect to a model parameter, and the deriva- 
tives are with respect to cosmological parameters and the pa- 
rameters characterizing the observable-mass distribution. We 
invert this Fisher matrix to obtain the covariance matrix and 
constraints on model parameters. 

The second term in this Fisher matrix characterizes the in- 
formation from the sample variance. Since the sample vari- 
ance depends on both cosmology and observable-mass distri- 
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bution, this "noise" in cluster coun ts actually provides "sig- 
nal." As we will see in Section IXTl an analogous Fisher ma- 
trix is used to calculate the constraints from follow-up obser- 
vations. In those observations, the variance in follow-up mass 
measurements will provide critical information for breaking 
the degeneracies between model parameters. 

2.2. Survey and Model Assumptions 
In this work, we consider a DES-like optical survey with a 
survey area of 5000 deg 2 , and a survey depth such that clusters 
with redshift z < 1 are robustly identified. For the counts-in- 
cells analysis, the survey volume is divided into cells of area = 
lOdeg 2 and Az = 0. 1 (see. e.g.. lLima & Hul2004l) . We assume 
that the mass threshold for the survey is M Q \, S = 10 13,7 /T 1 M Q , 
and that the cluster sample is binned by M b s into eight bins 
with logarithmic bin size Alog 10 M o b s = 0.2. Note that the 
mass threshold and binning are based on the mass proxy M b s 
rather than the true cluster mass M. We have tested the impact 
of the binning on our results and found that this binning is fine 
enough that we do not lose information from the data. Finer 
binning is also unnecessary because the scatter between the 
observable and mass sets an effective mass resolution. 

For the observable-mass distribution P(lnM b s | InM), we 
assume a Gaussian distribution with mean (lnM+lnMbi as ) and 
variance cr 2 bs . Both lnMbi as and cr 2 bs are assumed to vary lin- 
early with InM and ln(l +z) via 

lnMbias = lnM + a M ln(M/M pivot ) + a z ln( 1 + z) (7) 



fobs = Co + Pm ln(M/M pivot ) + j3 z ln(l + z) , 



(8) 



giving nuisance parameters: (lnMo,aM,ce z ,al, (3 m , /?,). For 
our fiducial model, we assume an unbiased and non-evolving 
observable-mass distribution with <tq = 0.5 (all other param- 
eters are set to zero), whic h is consist e nt wit h the results in 
iGladders et all (120071) and iRozo et al.1 (120101) . In addition, 
Mpi vot should be close to the mass scales of interest; we choose 
Mpi vot = 10 13 7 /j _1 Mq, noting that the precise value of M p i vot 
does not affect our results. 

Throughout this work, we assume the Wilkinson Microwave 
Anis otropy Probe Five Ye ar (WMAP5) cosmological parame- 
ters dKomatsu et al.l2009l : wq = -1 , w a = 0, fi DE = 0.726, f\ = 
0, fl m h 2 = 0.136, n h h 2 = 0.0227, n s = 0.960, A c = 4.54 x 
10~ 5 at k = 0.05Mpc _1 ), and use the Planck-prior Fisher matrix 
provided by W. Hu and Z. Ma. We use the linear m atter power 
spectrum calculated by CAMB dLewis et al.ll2000l) . The halo 
mass is defined with spherical overdensity of A = 200 with 
respect to the mean matter densit y , and we use the updated 



mass functio n from iTinker et a . (120081) 



„ and the halo bias 
function from lSheth et al.1 (1200 M Under these assumptions, 



we expect that a DES-like survey will observe approximately 
2 x 10 5 clusters in total. 

3. IMPROVING DARK ENERGY CONSTRAINTS WITH OPTIMAL 
FOLLOW-UP STRATEGIES 

3.1. Constraints from Follow-up Mass Measurements 

In this section, we calculate the additional constraints from 
follow-up observations. Given the optical cluster sample from 



8 An updated halo bias function is available in Tinker et al. (2010); the 
difference between the two functions does not significantly affect our results. 
We also note that the mass function and the halo bias function we use are not 
in a consistent framework; since we only use their dependence on cosmology, 
this inconsistency is not important. 



a DES-like survey, we select some clusters from each mass 
and redshift bin to follow up — for example, to measure their 
properties in X-ray or SZ — and estimate the cluster mass 
more precisely. These follow-up mass measurements provide 
further constraints on the observable-mass distribution, thus 
improving the dark energy constraints of the original survey. 
Throughout Section [3] we assume the follow-up observations 
provide mass measurements with infinite precision; the com- 
plications of follow-up mass tracers will be explored in Sec- 
tion H 

Our goal is to constrain the scaling relation and the scat- 
ter of optical richness. We note that the term "scatter" usu- 
ally has different meanings in theoretical and observational 
contexts. In the theoretical model, the scatter of lnM b s at 
fixed InM (er b s in our notation) is used because the model is 
based on the mass function, which is a function of M. In con- 
trast, one observationally constrains the scatter of InM at fixed 
lnMn hs because follow- ups are selected based on their M b s 
(e.g.. lRozcT et al. 2009bj). In general, the theoretical model is 
based on P(lnM b s | InM), while observations put constraints 
on P(lnM| lnM b s ). Our goal is therefore to convert con- 
straints on the latter distribution to constraints on the former. 

Let us return to the original survey, focusing on the follow- 
up observations in a bin specified by 0(lnM o b s ) in a narrow 
redshift range. If the follow-up mass Mf perfectly recovers 
the true mass, the mean and variance of InMf will read 



(InMf) =£[lnM f |</>(lnM ob s)] 

/dn 
<ilnMlnM— < 
dlnM 



| InM) , 



(9) 



and 



V = Var[lnM f |0(lnM obs )] 



x 



<ilnM(lnM-(lnM f )) 2 



dn 
d\nM { 



| InM) . (10) 



The information from a single follow-up mass measure- 
ment is given by an analog of Equation [6] 

= (InMf) ^V- 1 (lnM f ) ^+^V- 1 V a V- 1 V tfi , (11) 

where the superscript (;) indicates the mass and redshift bin 
from which we select follow-ups. If we follow up Ni clusters 
in bin i, the Fisher matrix for the whole follow-up program 
reads 

F aP = Y,NiF%. (12) 



This Fisher matrix is added to the Fisher matrix of counts- 
in-cells (Equation [6} to improve the constraints on nuisance 
parameter^. Note that the second term in Equation [TT1 plays 
the role of "noise as signal" as in the case of counts-in-cells, 
characterizing the information included in the mass variance 
of follow-ups. 

Figure QJ shows how follow-up observations improve the 
dark energy FoM (defined in Section 1 ). We calculate the ratio 

9 Technical note. In principle, when we calculate the Fisher matrix F a p, 
the derivatives should include cosmological parameters. However, to cor- 
rectly model the cosmological information in follow-up observations, we 
need to include the covariance between the follow-up observations and the 
original cluster survey. For simplicity, we ignore this covariance and only 
consider the information for nuisance parameters, noting that the cosmologi- 
cal information is negligible for up to 1000 follow-ups. 
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FIG. 1 . — Improvement in the dark energy FoM due to follow-up observa- 
tions, relative to the fiducial value (FoMfy) for a DES-like survey. The black 
solid curve corresponds to optimal follow-up strategies that maximize the 
FoM; the red dashed curve assumes that the follow-ups are evenly selected 
from all mass and redshift bins; the blue dotted curve also assumes even 
selection, ignoring the variance of the follow-up mass measurements. The 
follow-ups can significantly improve the FoM, especially when the target se- 
lection is optimized and when the variance of follow-up mass measurements 
is included. 

between the improved FoM and the fiducial FoM from a DES- 
like survey (without follow-ups) and demonstrate how this ra- 
tio improves as the number of follow-up mass measurements 
increases. For the two lower curves, we assume that follow- 
up targets are evenly selected across all bins (i.e., Ni = iVf/80, 
including eight bins in M D b s and ten bins in redshift). We also 
limit the number of follow-ups in each bin by the number of 
clusters that a DES-like survey is expected to observe. 

The red dashed curve shows the improvement of the FoM 
using the information from both the mean and the variance 
of follow-up mass measurements (i.e., both terms in Equa- 
tion n~TT >. As can be seen, the follow-ups can substantially 
improve the FoM; for example, for 100 follow-up mass mea- 
surements, the FoM can be improved by 40.3%. For com- 
parison, the blue dotted curve shows the improvement in the 
FoM using the information only from the mean of follow-up 
mass measurements (i.e., only the first term in Equation [TTJ. 
As can be seen, lacking the information from the variance can 
substantially reduce the effectiveness of follow-ups. This case 
may be relevant to stacked cluster samples which provide the 
mean but not the variance of cluster mass. Finally, the black 
solid curve corresponding to the most significant improve- 
ment is based on the optimal follow-up strategies, which will 
be discussed in the next section. 

In addition to evenly selected follow-up targets, we also in- 
vestigate the case of selecting a fixed fr action of the cluster 
sample in each bin, which is assumed in iMajumdar & Mohrl 
( 12003b . This selection gives slightly lower FoM values than 
our evenly selected follow-ups; the reason will become clear 
in the next section. 

3.2. Optimizing Follow-up Strategies: A Simulated 
Annealing Approach 

Instead of selecting the same number of follow-up targets 
in all bins, in this section, we explore how we can further 
improve the FoM by optimally selecting the follow-ups. We 
pose the following optimization problem: for a fixed number 
of follow-up measurements, Ni, how should we select follow- 
up targets to maximize the FoM? 

We use a slight variant of the Metropolis-based simulated 
annealing algorithm to find the optimal follow-up strategy. 



The idea of simulated annealing comes from a physical phe- 
nomenon: when a metal cools slowly, its atoms will rearrange 
to achieve the minimum of internal energy. This rearrange- 
ment sometimes moves toward higher-energy states due to 
thermal fluctuations, which allow the system to overcome en- 
ergy barriers separating different local minima. As the system 
cools, these thermal fluctuations become rare; provided the 
cooling process is slow enough, the system will eventually 
approach its global energy minimum. 

The simulated annealing algorithm is suitable for our prob- 
lem because our problem is combinatorial and the total num- 
ber of configurations is factorially large. (If we use 80 bins, 
there are 10 52 configurations for Nf= 100 and 10 1 21 configu- 
rations for Nf = 1000.) Following the guidelines in Pr ess et al.l 
d200l . we design our algorithm as follows: 

1 . The system configuration is characterized by the num- 
ber of follow-up targets in each mass and redshift bin, 
N = (Ni, ...,Ngo). (We use eight bins in M bs and ten bins 
in redshift.) We note that the general configurations do 
not change with the detail of binning. 

2. The rearrangement of clusters in bins, or the transition 
from one configuration to another, is designed as fol- 
lows: given an initial configuration, for each "donor " 
bin i, we randomly pick a "receiver" bin j and transfer 
"trans clusters to the receiver bin. Here j is randomly 
chosen from all available bins, and «tr ans is a random 
integer between and «ii m it- We choose n^t to be 1% 
of the total number of follow-ups Nf, for example, if 
iVf = 100, we transfer or 1 cluster at a time. We also 
require the number of follow-ups in each bin to be be- 
tween zero and the number of clusters expected to ob- 
served in that bin. After running i through all bins, i.e., 
letting each bin play the role of donor once, we reach a 
new nearby configuration. 

3. The objective function is the FoM, which we are try- 
ing to maximize by sampling different configurations. 
Applying the idea of the Metropolis algorithm, we start 
from the current configuration (Ni, with the FoM value 
/•j) and sample a nearby configuration (N t ,y, with the 
FoM value F try ). If ftry > F\, N t ,y is accepted at this step; 
that is, we set N;+i = Nay. If F ay < Fj, N try is accepted 
with probability exp[{F tly - F{) /T]. Here T is the "tem- 
perature" parameter that determines the probability of 
moving to a smaller FoM value (analogous to thermal 
fluctuations). 

4. To design our annealing schedule, we start with a T 
value that roughly gives an acceptance rate of 0.2; this 
rate empirically indicat es a fair sampling of the config- 
uration space (see, e.g jDunklev et al.l20 05 for the case 
of Markov chain Monte Carlo). After running 10 4 iter- 
ations with this temperature, we lower the temperature 
by 10% and run 4000 iterations as one step of the an- 
nealing. We repeat this annealing procedure between 
10 and 40 times (depending on the size of the configu- 
ration space) until the improvement in the FoM is neg- 
ligible and the system "entropy" is low in the sense that 
clusters are concentrated in a few bins. 

Since our configuration space is factorially large, we reduce 
its size to facilitate the sampling. We start with only 40 bins 
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FIG. 2. — Optimal follow-up strategies that maximize the FoM at a given 
number of follow-up observations JVf , calculated with a simulated annealing 
algorithm. Each pixel corresponds to one mass and redshift bin in the optical 
cluster survey, and the corresponding number of follow-ups in this bin is 
shown. Upper: Nf = 10; lower: Nf = 100. As can be seen, low-redshift bins 
are highlighted because they provide complementary information to the CMB 
prior. In addition, the follow-up strategies prefer the corners, because these 
bins provide the longest lever amis for constraining the mass and redshift 
dependence of the observable-mass distribution. 



by doubling the bin size in M oi , s and determine where the rel- 
evant bins are. We then return to our original binning and 
exclude irrelevant bins to reduce the size of the configuration 
space. 

Regarding the convergence of our algorithm, we note that 
although the simulated annealing algorithm will eventually 
converge to the global optimum, pursuing this convergence 
is impractical due to the extremely large configuration space. 
We use a sufficiently high temperature at the beginning to en- 
sure that the configuration space is fairly sampled, but we can- 
not guarantee that the global optimum is found at the end. 
However, after testing several different initial configurations, 
we find that various local optima are very close to each other. 
Each of these optima provides significant improvement when 
compared to the evenly selected follow-ups. We thus expect 
that our method provides a good solution to the optimization 
problem. 



Figure [2] illustrates two examples of our optimization; the 
upper panel corresponds to ten follow-ups in total, Nf = 10, 
and the lower panel corresponds to yVf = 100. For Nf = 10, 
the optimal configuration focuses on the highest and the low- 
est mass bins at low redshift. This configuration reflects the 
fact that given the tight CMB prior, most of the constraining 
power of a cluster survey lies in the low-redshift cluster sam- 
ple, which provides the longest possible lever arm for con- 
straining the evolution in dark energy. This optimal follow-up 
strategy improves the FoM by 16.1% relative to the fiducial 
FoM of the survey. For Nf = 100, high-redshift clusters start to 
become important, and the improvement in the FoM is 76.4%, 
compared to 40.3% for uniform sampling. We note that in the 
highest-mass and lowest-redshift bins, clusters are very rare 
and we reach the limit of following up all clusters in these 
bins. 

We can see that the optimal follow-up strategies always se- 
lect clusters in the most extreme bins — only the corners of the 
mass and redshift bins shown in Figure [2] are populated. This 
result reflects our assumption that the observable-mass distri- 
bution depends on mass and redshift via power laws, which 
are well constrained by the extremes. These strategies also 
imply that following up a random fraction of the cluster cat- 
alog is inefficient as a way to improve the FoM, since typical 
clusters are in the low-mass and mid-to-high redshift regime. 

In practice, following up clusters in the highest-redshift 
and lowest-mass bin is impractical, if not impossible. Con- 
sequently, we explore how superior this bin is relative to its 
neighbors. We find that this bin is not significantly better than 
other high-redshift and low-mass bins. The lowest-redshift 
bins, on the other hand, are always the most essential to fol- 
low up. Thus, for designing a practical follow-up program, 
one should always prioritize the lowest-redshift bins and try 
to extend to the high-redshift and low-mass regime. We will 
further explore the detectability and observational cost issues 
to find the most cost-effective strategies in Section|5]. 

The improvement in the FoM due to the optimization is 
shown as a black solid curve in FigureQ] We select various Nf 
values and apply the optimization algorithm to maximize the 
FoM. Compared to the case of uniformly sampled follow-ups, 
the optimization can further improve the FoM. 

For an interesting comparison, we note that lFrieman et alJ 
(120031) optimized the supernova survey strategy to minimize 
the errorbar on w. They demonstrated different survey strate- 
gies to complement different CMB priors. We also note that 
iParkinson et al.l (120071 1201 Oh applied simulated annealing to 
the design of baryon acoustic oscillation surveys. 

4. REQUIREMENTS FOR THE FOLLOW-UP MASS PROXY 
4.1. Scatter and Covariance of the Mass Proxies 

So far we have assumed that follow-up observations can re- 
cover the true mass precisely; i.e., Mf = M. In reality, Mf itself 
is a mass proxy and has a scatter <7f around the true mass M. 
Moreover, this scatter may correlate with o- b s , the scatter of 
M bs around M. Therefore, a proper analysis of the effects of 
follow-up observations should include the observable-follow- 
up-mass distribution P(lnM ODS , InMf | InM). 

We assume that the follow-up mass tracer correlates with 
the true mass more tightly than the survey mass tracer does; 
for example, X-ray and SZ mass proxies h ave lower scat- 
ter th an any known optical mass proxy (e.g.. iRravtsov et aT] 
120061) . We have assumed cr b s = 0.5 throughout our analysis; 
we further assume that <7f = 0. 1 and that the two mass proxies, 
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FIG. 3. — Impact of the scatter and the correlation of mass proxies on the 
effectiveness of optimized follow-up observations. Here we compare three 
sets of assumptions about cr b s and of ; for each set we show the cases of fixed 
(solid curves) and marginalized (dashed curves) p and of. When of <C cr bs> 
the FoM is almost unaffected by marginalization, as shown in (1). When of 
is large, the effectiveness of follow-ups is degraded, as shown in (2). On the 
other hand, when the survey has small scatter (small o b s ), follow-ups can 
significantly improve the FoM, as shown in (3). 

lnM b s and InMf, have a correlation coefficient p. No prior 
knowledge is assumed about p. 

Let us assume that the observable-follow-up-mass distri- 
bution P(lnM bs,lnMf| InM) is a bivariate Gaussian distribu- 
tion in (lnM b s - lnM) and (InMf-lnM) with the covariance 
matrix 

\ pa obs a f of J ' 
The mean of InMf is given by 

£[lnMf|0(lnMobs)] oc [ dhvM—^— [ <flnMflnM f 
J d\nM J 

J lnM obs 0(lnM obs )P(lnM obs ,lnM f |lnM) . (13) 

The variance of InMf can be calculated similarly. Since this 
variance involves both o" b s and Of, in the limit of <C of bs , we 
expect the variance of InMf to be dominated by of bs . On the 
other hand, if o"f is larger (i.e., the follow-up mass measure- 
ments have larger intrinsic scatter), the resulting mass vari- 
ance of follow-ups will be larger and the FoM improvement 
will be less significant. 

We find that different values of p barely affect the resulting 
FoM: positive p slightly improves the FoM, while negative p 
slightly degrades the FoM. When compared to positive p, neg- 
ative p widens the mass range of the follow-ups (with larger 
variance in InMf) and has slightly less constraining power. 
The reason is that when the correlation is negative, er b s and 
erf tend to scatter M D b s and Mf to opposite directions, and clus- 
ter samples selected based on M b s will have larger variance 
in Mf. Since the differences are small, we only show the case 
of a zero correlation for demonstration. 

We allow the scatter Of and the correlation coefficient p 
to be additional free parameters in the Fisher matrix (Equa- 
tion [12]| and study their impact. In Figure [3] we explore how 
different assumptions of scatter affect the FoM improve men t 
(based on the optimal configurations found in Section I3.21 i. 
Each pair of solid and dashed curves corresponds to a set of 
assumptions about o- b s and af, the solid curve corresponds to 
fixed Of and p in the Fisher matrix, while the dashed curve 
corresponds to marginalized Of (with a prior of 0.05 on of) 



and p (with no prior). 

The two black curves in the middle, labeled as (1), corre- 
spond to our baseline assumption about the scatter: o- b s = 0.5 
and Of = 0.1. As can be seen, the uncertainty in p barely de- 
grades the FoM. This minor effect is due to the small af, given 
such a small scatter, the follow-up mass will closely follow the 
real mass even for an uncertain correlation. 

On the other hand, if Of is large, the follow-ups have large 
intrinsic scatter and provide less constraining power. We 
demonstrate the degradation with the two blue curves at the 
bottom, labeled as (2), which correspond to a high scatter in 
follow-ups: Of = 0.4. Comparing the blue solid curve to the 
black solid curve, we can see that the FoM is degraded due 
to the larger scatter in follow-ups. If we further consider the 
uncertainty in p, the blue dashed curve shows stronger degra- 
dation than the baseline case. 

We also explore the effect of o" b s - The two red curves at 
the top, labeled as (3), correspond to the case of a low scatter 
in lnM b s : CT bs = 0.2. Comparing the red solid curve to the 
black solid curve, we can see that the FoM is significantly 
improved due to the smaller mass variance of the follow-ups. 
Since the follow-ups are selected by M b s , lowering o- b s leads 
to follow-ups with a less spread in mass and provides better 
constraints. The red dashed curve shows that marginalizing 
over Of and p only modestly degrades this result, since in this 
regime both mass tracers have very high fidelity. 

We note that the FoM improvement due to a small CT D b s 
comes from the follow-ups rather than self-calibration. When 
we lower a b s from 0.5 to 0.2, the fiducial FoM from self- 
calibration (without follow-ups) barely changes. This result 
reflects the fact that simply reducing the value of the scatter is 
not as effective as improving the constraints on the scatter. 

We also note that Of and p are assumed to be independent of 
mass and redshift; detailed properties of o-f and p are beyond 
the scope of this work. However, possible dependence of 
these parameters on mass and redshift, if not well constrained , 
may severely degrade the FoM (see, e.g. JSahlen et al.ll2009l) . 

In summary, we have found that as long as at is sufficiently 
small, the effects of Of and p are negligible, and the uncer- 
tainty in p has only a modest impact on the efficacy of the 
follow-up observations. 

4.2. Systematic Error in Follow-up Mass Measurements 

In this section, we ignore the scatter in the follow-ups and 
focus on the possibility that follow-up mass measurements 
systematically deviate from the true mass by a constant factor 
d. The mean of InMf takes the form 

£[lnM f |0(M obs )] = lnrf+£[lnM|0(M obs )]. (14) 

The parameter d characterizes the average systematic error of 
the follow-up mass measurements and has no impact on the 
variance of InMf. We include Ind as an additional nuisance 
parameter in the Fisher matrix (EquationfT2l and study its im- 
pact on the FoM. We are looking for the required constraints 
on Ind to avoid severe degradation of the FoM. 

When comparing Equations P71 and [l4l we note that lnMbi as 
and Ind are completely degenerate in determining £[lnMf |</>]. 
On the other hand, since ln<i does not affect the variance of 
InMf in a bin (Equation ITOb. this variance can provide infor- 
mation for lnMbi as an d break the degeneracy. Here we demon- 
strate again the importance of the mass variance in follow-up 
observations. 

Figure [4] shows how the systematic error degrades the ef- 
ficacy of our optimal follow-up strategies. We set the fidu- 
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FIG. 4. — Impact of systematic error in follow-up mass measurements on 
our optimal follow-up results. We assume that the follow-up mass measure- 
ments systematically deviate from the true mass by a constant factor d and 
compare different prior constraints on \nd. Uncertainty in d can substantially 
degrade the efficacy of follow-ups; for about 200 follow-ups, a degradation 
of less than 10% requires cr(lnd) < 0.05. 

cial value of Ind to be and compare different prior con- 
straints on lnii. As can be seen, for approximately 200 follow- 
ups, a degradation of less than 10% in the FoM requires 
Cpiior(lnrf) < 0.05. The required prior constraint on Ind de- 
pends on the number of follow-up observations; larger follow- 
up programs require even higher precision. Consequently, it 
is very likely that the systematic error in mass measurements 
will determine the efficacy of follow-up observations. 

5. OPTIMIZING X-RAY AND SZ FOLLOW-UP PROGRAMS: 
OBSERVATIONAL ISSUES 

In previous sections, we assumed that all optically selected 
clusters have an equal chance to be followed up, regardless 
of their mass and redshift. In reality, observing optically se- 
lected low-mass or high-redshift clusters may be very diffi- 
cult or eve n im possible with some methods; the optimization 
in Section 13.21 is thus impractical. For example, observing 
high-redshift clusters in X-ray requires substantially (if not 
prohibitively) more telescope time and sometimes has limited 
improvement in parameter constraints. Thus, we would like to 
optimize the follow-up strategy considering both the FoM and 
the cost of telescope time. In this section, instead of assuming 
a fixed number of follow-ups, we study how to optimize the 
follow-up strategy with limited observ ation al cost. We first 
model the observational cost in Secti on IBTTl and then demon- 
strate the optimization in Section l3T2l 

5.1. Observability and Cost Proxies 

For X-ray, we expect that a precise mass measurement re- 
quires certain photon counts; therefore, we assume that the 
telescope time for observing a cluster is inversely propor- 
tional to its flux of X-ray photons. This flux is proportional to 
Lx/D\, where Lx is the X-ray luminosity and Dl(z) is the lu- 
minosity distance. We assume a self-similar scaling relation 
from the fit of I Vikhlinin et al.1 d2009al) . L x cx M^ c E lS5 (z). 
This fit is based on the mass with overdensity 500 times the 
critical density (Msooc); while we calculate the mass function 
using overdensity 200 times the mean matter density (M200/11); 
therefor e, we convert Msqq c to M2oom using the fitting for- 
mula in iHu & Kravtsovl ( 12003b . We normalize the observa- 
tional cost such that one unit corresponds to the telescope time 
for observing a cluster of mass IQ 151 Ii~ 1 Mq at redshift 0.05. 
Measuring the mass of such a cluster to 10% accuracy using 



Yx takes approximately 0. 1 3 ks with a single XMM MOS cam- 
era (A. Mantz 2009, private communic ation^ We note tha t 
we use <jf = 0.1 for X-ray clusters (e.g.. lKravtsov et aT1l2006l) . 

For SZ, we expect that the observational time is propor- 
tional to the inverse square of signal-to-noise ratio, S/Noc 
Y I y/Ti, where Y is the integrated Compton-y parameter and 
fi is the angular size of the cluster. In virial equilibrium, 

Y oc M 5 / 3 pK 3 /D\, where p m (z) is the mean matter density 
and Da(z) is the angular diameter distance. The dependence 
on angular size comes from averaging the total cluster emis- 
sion over some number of detectors. The SZ cost proxy 

is therefore proportional to D\M^J£ n (\ +z)~ 4 . In addition, 
we exclude clusters of redshift less than 0.1; these clusters 
have large angular sizes and are contaminated by the pri- 
mary CMB anisotropy. We also exclude clusters of mass 
less than 10 h~ M@ because they are su bject to significant 
background confusion dHolder et al.l20071) . We normalize the 
observational cost such that one unit corresponds to the tele- 
scope time for observing a cluster of mass 10 15 1 Ii~ 1 Mq at red- 
shift 0.15. To observe such a cluster, it takes about 30 min- 
utes to obtain S/N=10 with the South Pole Telescope (SPT) 
(D. Marrone & B. Benson 2009, private communication). We 
use a slightly larger scatter for SZ clusters, Of = 0.2; simula- 
tions have sugge sted that this scatter may be intrinsic (e.g., 
iShaw et aT]|2008l) . and projection effects can further increase 
the scatter. Note that we only consider intrinsic scatter and 
ignore systematic errors in the mass measurements from both 
X-ray and SZ clusters. 

The top panels in Figure[5]present the mass and redshift de- 
pendence of the cost proxies. As can be seen, X-ray and SZ 
cost proxies have different patterns. The cost of X-ray clusters 
increases rapidly with redshift, while the cost of SZ clusters 
is almost constant with redshift. The latter is primarily sen- 
sitive to mass rather than redshift; thus, for SZ, high-redshift 
follow-ups are more available than low-mass ones. We will 
continue to factor in the total observational cost of a follow- 
up program, which is obtained by summing over the product 
of cluster number and cost in each bin. 

5.2. Optimizing the Follow-up Strategy at a Given 
Observational Cost 

Given limited telescope time for a follow-up program, we 
would like to find the strategy that maximiz es th e FoM. How- 
ever, our optimization algorithm in Section [3~l2l cannot be ap- 
plied directly; sampling a configuration at a given cost is not 
practical, since both the FoM and the cost depend on the con- 
figuration. We instead use a Monte Carlo approach: We sam- 
ple a configuration and find its corresponding point on the 
cost-FoM plane. After sampling many configurations, we 
can find the boundary of these points and estimate the upper 
bound of the FoM at a given cost. 

To generate these Monte Carlo p oints , we slightly modify 
the sampling algorithm in Section 13.21 At a given iVf, we 
sample ~ 10 s configurations and compute their corresponding 
cost and FoM. We then plot all these points on the cost-FoM 
plane and find the maximum of FoM at a given cost. To make 
the sampling more efficient, we modify the algorithm to max- 
imize (FoM/ln (cost)), which includes moderate dependence 
on cost. Since this objective function is not well justified, we 

10 We ignored the field-of-view limitations at low redshift when estimating 
this time scale, which is only used for order of magnitude estimates of the 
total telescope time required for a follow-up program. 
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FIG. 5. — Left: X-ray follow-ups. Right: SZ follow-ups. Top: observational cost proxies, which are proportional to the required telescope time to observe a 
cluster and are normalized to the lowest-redshift and highest-mass bin available for each method. Middle and bottom: optimal follow-up strategies for small and 
large follow-up programs. We maximize the FoM at a given total cost and show both the number and the percentage of follow-up targets in each bin. As can 
be seen, the configurations depend on the allowed cost. In general, X-ray follow-up programs favor low-redshift clusters first and extend to high-redshift as the 
allowed cost increases. On the other hand, SZ follow-ups include high mass clusters in a wide redshift range and extend toward low mass as the allowed cost 
increases. 



only use it in the sampling. However, it turns out that the con- 
figuration maximizing (FoM/ln (cost)) at a given Nf coincides 
with the boundary of the FoM at a given cost. We thus em- 
pirically propose that one can maximize (FoM/ln (cost)) at a 
given Nf to design follow-ups. 

The middle panels of Figure[5]show two examples of small 
follow-up programs involving about 150 clusters; both are the 
optimal strategies at a given cost. We present both the number 



and the percentage of follow-ups in each bin. Comparing Fig- 
ure|5]to Figure|2] we can see the impact of observational cost 
on designing follow-up strategies. For X-ray, as expected, the 
high-redshift and low-mass clusters are down-weighted be- 
cause of their high cost; instead, clusters at low redshift are 
chosen. For SZ, due to its almost redshift-independent cost, 
massive clusters with a wide redshift range are preferred. 
The bottom panels of Figure [5] present two examples of 
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FIG. 6. — Improvement in the FoM as a function of observational cost. Left: X-ray follow-ups. Right: SZ follow-ups. The lower ;t-axes correspond to 
the cost proxies presented in Figure [5] while the upper ;t-axes correspond to the observational time for two specific instruments (see the text). The dashed 
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optimized curves show changes of slopes near the stars and crosses we marked, reflecting the changes of follow-up strategies. For small follow-up programs, 
optimal follow-up strategies can reduce the cost by up to an order of magnitude for a given FoM value. On the other hand, for large follow-up programs, 
optimizing with respect to cost is not essential. In the right panel, we also mark two points relevant to the SPT survey: 1000 deg 2 (triangle) and 2000 deg 2 
(square), assuming a constant mass threshold lO 14 ' 5 /i _1 Af0. 



large follow-up programs, both of which almost double the 
FoM. As the allowed cost increases, follow-up strategies 
change. These optimal configurations now extend toward the 
high-redshift and low-mass corners, as in the cases in Figure 
|2] For X-ray, the follow-ups include a wider range of mass 
and redshift. For SZ, less-massive clusters are included, and 
the configuration still favors a wide range in redshift. 

These follow-up strategies are related to our assumptions 
that the observable-mass distribution depends on mass and 
redshift via power laws. To constrain power laws, sampling 
a range of mass and redshift is the most effective. We em- 
phasize that these follow-up strategies are targeting dark en- 
ergy constraints alone and do not comprehensively consider 
cluster science. In reality, skipping follow-ups in some bins 
may be risky for cluster science. In addition, we will need 
some follow-ups in every mass and redshift bin to test our 
power-law assumptions. After the power-law assumptions are 
justified, we can more confidently use our follow-up strate- 
gies to improve dark energy constraints. We also emphasize 
that a single instrument is assumed for the follow-up obser- 
vations. For SZ, however, clusters of different redshift ranges 
are likely to be observed with different instruments with dif- 
ferent normalizations in cost. These complications will be 
instrument specific and will change the optimization. 

Figure [6] shows the FoM improvement due to optimization 
as a function of cost for both X-ray (left panel) and SZ (right 
panel) follow-ups. The low er x- axes correspond to the cost 
proxies discussed in Section I5T1 while the upper x-axes cor- 
respond to the telescope time specific to XMM and SPT. We 
compare the optimal cases (solid curves) with the baseline 
cases (dashed curves). The baseline cases correspond to equal 
number of follow-ups in all mass and redshift bins with cost 
less than 200. The importance of optimizing follow-up strate- 
gies is abundantly clear: to achieve a specified FoM, our opti- 
mal strategies can reduce the required telescope time by about 
an order of magnitude for small follow-up programs. 

In Figure|6] the optimal cases for X-ray and SZ show differ- 
ent features. The stars and crosses mark the examples we have 
shown in Figure [5] As can be seen, for X-ray, as the allowed 
cost increases, the FoM increases less rapidly than SZ. Both 
curves show slope changes, which are caused by the changes 



in configurations, as we discuss below. 

For X-ray, when the allowed cost is below 10 4 , the most ef- 
fective strategy is to tighten the constraints on the observable- 
mass distribution in low-redshift bins (as shown in the mid- 
dle left panel in Figure [5]). When the allowed cost is high 
enough, the follow-ups can afford to constrain both low and 
high redshift bins (as shown in the bottom left panel). Since 
constraining two extreme redshift regimes gives much better 
constraints on the evolution, the slope of the FoM increases. 
Nevertheless, this slope increase is very close to the point 
where the optimal case approaches the baseline case. With 
such a large allocation of telescope time, optimization is no 
longer essential, and the uniform sampling can achieve the 
same FoM. 

For SZ, two obvious slope changes can be seen. The first 
one occurs near the cost of 1000. Below this cost, only mas- 
sive clusters are chosen (as shown in the middle right panel 
of Figure [5). Above this cost, less-massive clusters become 
affordable (as shown in the bottom right panel), and the mass 
dependence of the observable-mass distribution is better con- 
strained, leading to the slope increase of the FoM. The second 
change occurs near the cost of 10 4 , where the slope suddenly 
drops and the optimal case approaches the baseline case. At 
this point, we exhaust the information from the two redshift 
ends, and sampling the middle regime cannot make signifi- 
cant improvement. The lack of further improvement leads to 
the decrease of slope, and optimization is no longer essential 
when this amount of telescope time is available. 

Comparing X-ray and SZ, we can view the design of cost- 
effective follow-up strategies as a trade-off between constrain- 
ing mass dependence and constraining redshift evolution of 
the observable-mass distribution. When the cost is more sen- 
sitive to redshift than mass, as in the case of X-ray, one should 
prioritize the constraints on mass dependence regarding lim- 
ited telescope time. On the other hand, when the cost is more 
sensitive to mass than redshift, as in the case of SZ, redshift 
evolution should be prioritized. 

In the right panel of Figure [6] we add two points as refer- 
ences: the red triangle and the red square present the cases, 
respectively, of 1000 and 2000 deg 2 of SZ follow-ups; these 
assumptions are relevant for SPT. We assume a mass thresh- 
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FIG. 7. — Improvement in the FoM as a function of observational cost 
from X-ray follow-ups in addition to 2000 deg- SZ observations above 
lO 14 ' 5 /r'M0. These follow-up strategies all focus on low-redshift and low- 
mass clusters in order to provide complementary information to SZ follow- 
ups. We note that in this regime, the improvement in the FoM is less signifi- 
cant than previous cases because the fiducial FoM already includes substan- 
tial information from follow-ups. 

old of lO 14 5 /z _1 M0, which roughly corresponds to the mass 
threshold of the ongoing SPT survey (L. Shaw & B. Benson 
2009, private communication). However, here we assume that 
the follow-up mass measurements have a constant scatter and 
no systematic error. In reality, the SPT survey may not have 
the same precision in mass measurements for all clusters, and 
degradation due to inaccurate mass estimates is possible. 

Finally, we study the complementarity between X-ray and 
SZ follow-ups. We assume that 2000 deg 2 of the survey field 
is followed up by SZ with a mass threshold of IO^ VMq. 
We then optimize the X-ray follow-ups for clusters that are not 
observed by SZ. In Figure [7] we show the FoM improvement 
due to these additional X-ray follow-ups. The fiducial FoM 
corresponds to a DES-like survey with 2000deg 2 SZ follow- 
ups, and the baseline case corresponds to evenly selected tar- 
gets with cost below 200. To provide complementary infor- 
mation to SZ follow-ups, the optimal X-ray follow-up strate- 
gies would focus on low-redshift and low-mass clusters. We 
note that since the fiducial FoM already includes the informa- 
tion from SZ follow-ups, the improvement in the FoM is less 
significant than that in Figure [6] 

In this calculation, we assume the SZ follow-up mass mea- 
surements have 20% scatter and no systematic error. This 
is an optimistic assumption for SPT; in reality, clusters ob- 
served by SPT may have larger scatter and more complicated 
sources of systematic error. Therefore, our calculation only 
represents a limiting case. On the other hand, our calculation 
assumes that X-ray and SZ follow-ups are not overlapping. In 
reality, some clusters will be observed with both methods. In 
principle, one can calculate the combined information from 
both X-ray and SZ follow-ups and optimize follow-up strate- 
gies by including these doubly followed-up clusters. How- 
ever, since we assume very accurate mass inference for SZ 
clusters, X-ray observations will add little to it. In addition, 
computing the information from both follow-up methods will 
require modeling the full covariance between optical, X-ray, 
and SZ observables, which is beyond the scope of the current 
work. 

6. DISCUSSION 



In Section l4~Tl we studied the impact of the correlation be- 
tween different mass proxies. In principle, since the true mass 
is not observable, p cannot be directly measured. However, 
this correlation can be studied using consistency of scaling re- 
lations for different mass proxies. For example. iRykoff et al.1 
(120081) studied the scaling relation of the mean X-ray luminos- 
ity and mean weak lensing mass for optically selected sam- 
ples, which are binned by optical richness. They compared 
this scaling relation with the one derived from X-ray selected 
samples and found that the correlation between X-ray lumi- 
nosity and optical richness is consistent with zero. Although 
these authors did not provide constraints on the correlation, 
they demonstrated an effective way to study it. In addition, 
iRozo et ail (l2009al) used a similar analysis to obtain the con- 
straints on the correlation between X-ray luminosity and mass 
for a given optical richness, finding Pl x ,m\n > 0.85. 

This correlation can also be studied with si mulations. For 
example, the results of IWechsler et al.1 (120061) imply a slight 
anti-correlation: at a given halo mass, h alos with high concen- 
tration have lower richness on average. lCohn& Whitel<l2009l) 
studied the joint SZ and optical cluster finding in simulations. 
They demonstrated that the cluster mass estimates from op- 
tical richness and SZ flux are positively correlated. Detailed 
comparisons for differen t mass proxies, how ever, will require 
further explo ratio n (e.g., Stanek et al. 1 20091) . 

In Section 14.21 we studied the impact of the systematic 
error of follow-up mass measurements. Different follow-up 
methods have different sources of systematic error; here we 
compar e several di f ferent mass proxies studied in the liter- 
ature. iNagai et aD (120071) simulated the X-ray mass mea- 
surements and found that the total cluster mass derived from 
hydrostatic equilibrium is systematically l ower than the true 
mass by about 5%- 20% (also see, e.g., iRasia et ail 120061: 
iMahdavi et al.l 12008). They also found that the deviation is 
less significant for relaxed systems and for the inner regions of 
the clusters. This underestimate can be attributed to the non- 
hydrostatic state of the intracl uster medium that provides ad - 
ditional pressure support (e.g. jEvrar 51[T9MlLWeTani2009l) . 
On the other hand, they found that the estimate of the mass of 
th e intracluster medium ( M pas ) is robust. 

IVikhlinin et al.1 (1200 9a) used multiple X-ray indicators, in- 
cluding M gas , the temperature Tx, and estimated total thermal 
energy Yx = M gas x Tx to calibrate the cluster mass. These 
mass indicators have been shown by simula tions to have a 
tight sc aling relation w i th the t otal mass (e.g.. lKravtsov et ail 
120061) . IVikhlinin et al.1 (l2009al) also calibrated the total mass 
with low-redshift samples and cross-checked it with the weak 
lensing results. Therefore, X-ray clusters, when carefully cal- 
ibrated, are likely to provide the most robust mass proxy and 
the most ideal method for follow-up observations. 

On the other hand, SZ observations are still limited by 
statistics and have few observatio nal studies on the scaling 
relation and the cluster profile (e.g.. lMroczkowski et al.l2009t 
iBonamente et al. 2008)). T heir utility is thus yet to be fully 
demonstrated. In addition, iRudd & Nagail (|2009) simulated 
the two-temperature model for clusters found that the non- 
equipartition of electrons and ions may lead to 5% underesti- 
mate of the mass derived from SZ. Nevertheless, simulations 
have shown that the scatter of the SZ mass proxy is small. If 
the systematic error can be well constrained, SZ follow-ups 
may become very influential given their statistical power in 
the near future. 

Another possibility of follow-up observations is weak lens- 
ing mass measurements of individual clusters. iHoekstral 
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( 120071) and Zhang et al. (2008) compared the mass measure- 
ments from weak lensing and X-ray, finding good agreement. 
However, weak lensing mass measurements usually have 20% 
uncertainties due to projection along the line of sight, and 
the current statistics are still low. In addition, detailed un- 
derstanding of the photometric redshift properties of source 
galaxies is required to avoid systematic error in the reco vered 
weak lensing mass (see e.g.,|Mandelbaum et al. 2 008bl) . 

Stacked weak lensing analysis has been used to measure 
the mean mass of the clusters for a given optical r ichness 
(e.g.. Uohnston et al.|[2007t iMandelbaum et al.ll2008al) . This 
method does not suffer from projection effects caused by un- 
correlated structure and allows one to estimate the mass of 
low-mass clusters, for which individual weak lensing cannot 
be detected. However, the stacked analysis cannot provide the 
variance of mass, which, as we have shown, contains impor- 
tant information. If the variance of mass can be determined 
using this method in the future (for example, by resampling), 
the constraining power of this method will be improved. 

Finally, we note that one caveat of our results is the as- 
sumption that the optical richness-mass distribution is well 
described by a log-normal distribution and the scaling and 
scatter follow power laws. The validity of these assumptions 
will need to be tested explicitly with both simulations and ob- 
servations. 

7. CONCLUSIONS 

We studied the impacts of follow-up observations — more 
precise measurements of cluster mass — on the constraining 
power of large optical cluster surveys. Considering the self- 
calibrated cluster abundance data from the DES, we demon- 
strated that the dark energy FoM can be significantly im- 
proved. Our primary findings are 

1. Optimal target-selection strategies are essential for 
maximizing the power of modestly sized follow-up pro- 
grams. For instance, 100 optimally selected follow-ups 
can improve the FoM of a DES-like survey by up to 
76.4%, which is compared to a 40.3% improvement due 
to evenly selected follow-ups. Random sub-sampling 
of the cluster catalog is even less effective. Gener- 
ally speaking, one should always follow up low -redshift 
clusters first, and then extend to t he highe r-redshift and 
lower-mass regime (Sections l3.1l and [3T2i l. 

2. The scatter of the follow-up mass proxy and the co- 
variance between the optical richness and the follow- 
up mass proxy have only modest effects on the FoM, 
provided that the follow-up mass proxy has sufficiently 
small scatter. On the other hand, although lowering the 
scatter of optical richness does not change the baseline 
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self-calibration results, it will s igni ficantly enhance the 
efficacy of follow-ups (Section l4~TI ). 

3. Systematic error in follow-up mass measurements 
should be controlled at the 5% level to avoid severe 
degradation. In addition, if only the mean of cluster 
mass is measured, the systematic error of follow-up 
mass proxy will be degenerate with the systematic error 
of optical richness; measuring the vari ance of cluster 
mass can break this degeneracy (Section l4~2l ). 

4. We explored observational issues to propose more prac- 
tical X-ray and SZ follow-up programs. The observa- 
tional costs of X-ray and SZ are, respectively, sensi- 
tive to redshift and mass, which in turn leads to differ- 
ent follow-up strategies. To achieve 50% improvement 
in the FoM, the most cost-effective follow-up strategy 
involves approximately 200 low-redshift X-ray clus- 
ters or massive SZ clusters. In general, our optimal 
strategies can reduce the observational cost required to 
achieve given dark energy constraints by up to an order 
of magnitude (Section[5]). 

A follow-up mass tracer that has systematic error under- 
stood at the 5% level will substantially benefit optical cluster 
surveys. On the other hand, reducing the scatter of optical 
mass tracer will significantly improve the efficacy of optically 
selected follow-ups. Current observational resources allow a 
few hundred low-redshift X-ray clusters, and in the near future 
hundreds or thousands of SZ clusters will become available. 
Therefore, detailed follow-up studies of a small but optimally- 
selected cluster sub-sample have the potential to be a powerful 
complement to current and imminent cluster surveys. 
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